Modeling Review Spam Using Temporal Patterns and Co-bursting Behaviors
نویسندگان
چکیده
Online reviews play a crucial role in helping consumers evaluate and compare products and services. However, review hosting sites are often targeted by opinion spamming. In recent years, many such sites have put a great deal of effort in building effective review filtering systems to detect fake reviews and to block malicious accounts. Thus, fraudsters or spammers now turn to compromise, purchase or even raise reputable accounts to write fake reviews. Based on the analysis of a real-life dataset from a review hosting site (dianping.com), we discovered that reviewers’ posting rates are bimodal and the transitions between different states can be utilized to differentiate spammers from genuine reviewers. Inspired by these findings, we propose a two-mode Labeled Hidden Markov Model to detect spammers. Experimental results show that our model significantly outperforms supervised learning using linguistic and behavioral features in identifying spammers. Furthermore, we found that when a product has a burst of reviews, many spammers are likely to be actively involved in writing reviews to the product as well as to many other products. We then propose a novel co-bursting network for detecting spammer groups. The co-bursting network enables us to produce more accurate spammer groups than the current state-of-the-art reviewer-product (coreviewing) network.
منابع مشابه
Bimodal Distribution and Co-Bursting in Review Spam Detection
Online reviews play a crucial role in helping consumers evaluate and compare products and services. This critical importance of reviews also incentivizes fraudsters (or spammers) to write fake or spam reviews to secretly promote or demote some target products and services. Existing approaches to detecting spam reviews and reviewers employed review contents, reviewer behaviors, star rating patte...
متن کاملModeling of Air Pollutants’ Dispersion by Means of CALMET/CALPUFF (Case Study: District 7 in Tehran city).
The current study aims at modelling the dispersion of two pollutants, namely CO (carbon monoxide) and SO2 (sulfur dioxide) released from District 7 of Tehran Municiaplity, from 20 main line sources, by means of CALPUFF modeling system. CALPUFF is a non-steady state puff modeling software which employs meteorological, terrain, and land-use data to effectively simulate air pollutants' dispersion ...
متن کاملModeling of Air Pollutants’ Dispersion by Means of CALMET/CALPUFF (Case Study: District 7 in Tehran city).
The current study aims at modelling the dispersion of two pollutants, namely CO (carbon monoxide) and SO2 (sulfur dioxide) released from District 7 of Tehran Municiaplity, from 20 main line sources, by means of CALPUFF modeling system. CALPUFF is a non-steady state puff modeling software which employs meteorological, terrain, and land-use data to effectively simulate air pollutants' dispersion ...
متن کاملUnderstanding Temporal Human Mobility Patterns in a City by Mobile Cellular Data Mining, Case Study: Tehran City
Recent studies have shown that urban complex behaviors like human mobility should be examined by newer and smarter methods. The ubiquitous use of mobile phones and other smart communication devices helps us use a bigger amount of data that can be browsed by the hours of the day, the days of the week, geographic area, meteorological conditions, and so on. In this article, mobile cellular data mi...
متن کاملAnalyzing and Detecting Opinion Spam on a Large-scale Dataset via Temporal and Spatial Patterns
Although opinion spam (or fake review) detection has attracted significant research attention in recent years, the problem is far from solved. One key reason is that there is no large-scale ground truth labeled dataset available for model building. Some review hosting sites such as Yelp.com and Dianping.com have built fake review filtering systems to ensure the quality of their reviews, but the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1611.06625 شماره
صفحات -
تاریخ انتشار 2016